Search CORE

427 research outputs found

OntoCAT - an integrated programming toolkit for common ontology application tasks

Author: Helen Parkinson
Morris A. Swertz
Natalja Kurbatova
Tomasz Adamusiak
Publication venue
Publication date: 18/07/2011
Field of study

OntoCAT provides high level abstraction for interacting with ontology resources including local ontology files in standard OWL and OBO formats (via OWL API) and public ontology repositories: EBI Ontology Lookup Service (OLS) and NCBO BioPortal. Each resource is wrapped behind easy to learn Java, Bioconductor/R and REST web service commands enabling reuse and integration of ontology software efforts despite variation in technologies

Crossref

Nature Precedings

PyPedia:using the wiki paradigm as crowd sourcing environment for bioinformatics protocols

Author: Kanterakis Alexandros
Kuiper Joel
Potamias George
Swertz Morris A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background: Today researchers can choose from many bioinformatics protocols for all types of life sciences research, computational environments and coding languages. Although the majority of these are open source, few of them possess all virtues to maximize reuse and promote reproducible science. Wikipedia has proven a great tool to disseminate information and enhance collaboration between users with varying expertise and background to author qualitative content via crowdsourcing. However, it remains an open question whether the wiki paradigm can be applied to bioinformatics protocols. Results: We piloted PyPedia, a wiki where each article is both implementation and documentation of a bioinformatics computational protocol in the python language. Hyperlinks within the wiki can be used to compose complex workflows and induce reuse. A RESTful API enables code execution outside the wiki. Initial content of PyPedia contains articles for population statistics, bioinformatics format conversions and genotype imputation. Use of the easy to learn wiki syntax effectively lowers the barriers to bring expert programmers and less computer savvy researchers on the same page. Conclusions: PyPedia demonstrates how wiki can provide a collaborative development, sharing and even execution environment for biologists and bioinformaticians that complement existing resources, useful for local and multi-center research teams. Availability: PyPedia is available online at: http://www.pypedia.com. The source code and installation instructions are available at: https://github.com/kantale/PyPedia_server. The PyPedia python library is available at: https://github.com/kantale/pypedia. PyPedia is open-source, available under the BSD 2-Clause License

Proceedings - University of Groningen

University of Groningen

Springer - Publisher Connector

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen

designGG:an R-package and web tool for the optimal design of genetical genomics experiments

Author: Breitling Rainer
Fu Jingyuan
Jansen Ritsert C.
Li Yang
Swertz Morris A.
Vera Gonzalo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

BACKGROUND: High-dimensional biomolecular profiling of genetically different individuals in one or more environmental conditions is an increasingly popular strategy for exploring the functioning of complex biological systems. The optimal design of such genetical genomics experiments in a cost-efficient and effective way is not trivial. RESULTS: This paper presents designGG, an R package for designing optimal genetical genomics experiments. A web implementation for designGG is available at http://gbic.biol.rug.nl/designGG. All software, including source code and documentation, is freely available. CONCLUSION: DesignGG allows users to intelligently select and allocate individuals to experimental units and conditions such as drug treatment. The user can maximize the power and resolution of detecting genetic, environmental and interaction effects in a genome-wide or local mode by giving more weight to genome regions of special interest, such as previously detected phenotypic quantitative trait loci. This will help to achieve high power and more accurate estimates of the effects of interesting factors, and thus yield a more reliable biological interpretation of data. DesignGG is applicable to linkage analysis of experimental crosses, e.g. recombinant inbred lines, as well as to association analysis of natural populations

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

University of Groningen

Enlighten

Diposit Digital de Documents de la UAB

The University of Manchester - Institutional Repository

Proceedings - University of Groningen

Springer - Publisher Connector

ARTS repository - University of Groningen

PubMed Central

University of Groningen Digital Archive

Dissertations of the University of Groningen

The Interaction of Genetic Predisposition and Socioeconomic Position With Type 2 Diabetes Mellitus:Cross-Sectional and Longitudinal Analyses From the Lifelines Cohort and Biobank Study

Author: Bultmann Ute
Reijneveld Sijmen A.
Snieder Harold
Swertz Morris
van der Most Peter Johannes
van Zon Sander K. R.
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/04/2018
Field of study

OBJECTIVE: A strong genetic predisposition for type 2 diabetes mellitus (T2DM) may aggravate the negative effects of low socioeconomic position (SEP) in the etiology of the disorder. This study aimed to examine cross-sectional and longitudinal associations and interactions of a genetic risk score (GRS) and SEP with T2DM, and to investigate whether clinical and behavioral risk factors can explain these associations and interactions. METHODS: We used data from 13,027 genotyped participants from the Lifelines study. The GRS was based on single-nucleotide polymorphisms (SNPs) genome-wide associated with T2DM and was categorized into tertiles. SEP was measured as educational level. T2DM was based on biological markers, recorded medication use, and self-reports. Cross-sectional and longitudinal associations, and interactions, between the GRS and SEP on T2DM were examined. RESULTS: The combination of a high GRS and low SEP had the strongest association with T2DM in cross-sectional (OR: 3.84; 95% CI: 2.28, 6.46) and longitudinal analyses (HR: 2.71; 1.39, 5.27), compared to a low GRS and high SEP. Interaction between a high GRS and a low SEP was observed in cross-sectional (relative excess risk due to interaction: 1.85; 0.65, 3.05) but not in longitudinal analyses. Clinical and behavioral risk factors mostly explained the observed associations and interactions. CONCLUSIONS: A high GRS combined with a low SEP provides the highest risk for T2DM. These factors also exacerbated each other's impact cross-sectionally but not longitudinally. Preventive measures should target individual and contextual factors of this high-risk group to reduce the risk of T2DM

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Towards dynamic database infrastructures for mouse genetics

Author: Aidinis Vassilis
Alberts Rudi
Jansen Ritsert C.
Schofield Paul N.
Schughart Klaus
Smedley Damian
Swertz Morris A.
Wolstencroft Katy
Zouberakis Michael
Publication venue: University of Groningen, Groningen Biomolecular Sciences and Biotechnology Institute (GBB)
Publication date: 01/01/2008
Field of study

ARTS repository - University of Groningen

Towards dynamic database infrastructures for mouse genetics

Author: Aidinis Vassilis
Alberts Rudi
Jansen Ritsert C.
Schofield Paul N.
Schughart Klaus
Smedley Damian
Swertz Morris A.
Wolstencroft Katy
Zouberakis Michael
Publication venue: University of Groningen, Groningen Biomolecular Sciences and Biotechnology Institute (GBB)
Publication date: 01/01/2008
Field of study

Dissertations of the University of Groningen

Improved imputation quality of low-frequency and rare variants in European samples using the ‘Genome of The Netherlands’

Author: Deelen P. (Patrick)
et al not CWI
Marschall T. (Tobias)
Schönhuth A. (Alexander)
Swertz M.A. (Morris)
The Genome of the Netherlands Consortium
Publication venue: Nature Publishing Group
Publication date: 01/06/2014
Field of study

Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with ‘true’ genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed s

CWI's Institutional Repository

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Maastricht University Research Portal

Heriot Watt Pure

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

NIPTeR:an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

Author: de Boer Eddy N
de Weerd Hendrik A
Johansson Lennart F
Sijmons Rolf H
Sikkema-Raddatz Birgit
Swertz Morris A
Te Meerman Gerard J
van Dijk Freerk
Publication venue
Publication date: 01/01/2018
Field of study

BACKGROUND: Various algorithms have been developed to predict fetal trisomies using cell-free DNA in non-invasive prenatal testing (NIPT). As basis for prediction, a control group of non-trisomy samples is needed. Prediction accuracy is dependent on the characteristics of this group and can be improved by reducing variability between samples and by ensuring the control group is representative for the sample analyzed.RESULTS: NIPTeR is an open-source R Package that enables fast NIPT analysis and simple but flexible workflow creation, including variation reduction, trisomy prediction algorithms and quality control. This broad range of functions allows users to account for variability in NIPT data, calculate control group statistics and predict the presence of trisomies.CONCLUSION: NIPTeR supports laboratories processing next-generation sequencing data for NIPT in assessing data quality and determining whether a fetal trisomy is present. NIPTeR is available under the GNU LGPL v3 license and can be freely downloaded from https://github.com/molgenis/NIPTeR or CRAN.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Directory of Open Access Journals

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Publikationer från Högskolan i Skövde

Dissertations of the University of Groningen

FigShare

Feasibility of predicting allele specific expression from DNA sequencing using machine learning

Author: de Klein Niek
Franke Lude H.
Sinke Richard J.
Swertz Morris A.
van der Velde K. Joeri
van Dijk Freerk
van Gijn Marielle E.
Zhang Zhenhua
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/05/2021
Field of study

Allele specific expression (ASE) concerns divergent expression quantity of alternative alleles and is measured by RNA sequencing. Multiple studies show that ASE plays a role in hereditary diseases by modulating penetrance or phenotype severity. However, genome diagnostics is based on DNA sequencing and therefore neglects gene expression regulation such as ASE. To take advantage of ASE in absence of RNA sequencing, it must be predicted using only DNA variation. We have constructed ASE models from BIOS (n = 3432) and GTEx (n = 369) that predict ASE using DNA features. These models are highly reproducible and comprise many different feature types, highlighting the complex regulation that underlies ASE. We applied the BIOS-trained model to population variants in three genes in which ASE plays a clinically relevant role: BRCA2, RET and NF1. This resulted in predicted ASE effects for 27 variants, of which 10 were known pathogenic variants. We demonstrated that ASE can be predicted from DNA features using machine learning. Future efforts may improve sensitivity and translate these models into a new type of genome diagnostic tool that prioritizes candidate pathogenic variants or regulators thereof for follow-up validation by RNA sequencing. All used code and machine learning models are available at GitHub and Zenodo

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen